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ABSTRACT : 

PURPOSE: To obtain a gene of a newly elucidated hepatitis C virus (HCV) , 
a specific oligonucleotide, a method for detection thereof and a method 
for determining the genotype of the HCV. 

CONSTITUTION: The polynucleotide has a base sequence described in 
sequence Nos . 1 to 5. The oligonucleotide is described in sequence No 6. 
The methods for detecting an HCV gene and determining the genotype use 
the polynucleotide and oligonucleotide as a primer, a probe, etc. 
Furthermore, the polypeptide is described in sequence Nos. 15 to 19. 
Thereby, the HCV gene of the newly found genotype can be detected and the 
HCV genotype Can simultaneously be determined over a wide range. 
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[ st** 7 ] stgtigs 6 jsi sum u ^ ? i^* f 

[iS*JS8] SWHM*6£tt<0«aS5>Jt*rs*yd' 

*?M-?-F#3 2 1. 
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SfipaWT*4ii:**L"C^6. El, E 2/N SI 

H^Stf^**. HCVy^f-yKfsM-t&^k** 

^itNS2~NS5Wlirnf7-b'f^ 50 



^§§¥6-3 1 9 563 

6 
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swwaa+ottWttifc (^si¥2- 15340 
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2 3200) fcriOIICVJKfejWWB^ii 
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6. 4-093960) tli* Wjfifir^P i5£**?* 
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7. fcJ:t/SR03 7) *»^>»:<OJ:^(cL-CRNASrtt 
HlLfc. JlUniSOju HcaSAWh'J^IMRK ( l 0m 
M, pH8. 0)*aPi, 90X 1 0 3 r pmlZX 1 5 
»BO»IMJ««rff->fc. #4>iXJt^.P v MC2 0 0m 
MONaCK lOmMEDTA, 2% (MM./^) 
WF^WaBW-MJ^A (SDS) k lmg/m 160 
70ft-b'K Sr-g-tf F D *ifcSffi?8t ( 5 OmM, p H 

8. 0) fcjB*. 6 0 , €-ClB#^MflL, 7i/-*/ 
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T^i^ay (PCR) (CjifitMi 
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iZttmmtoZ. &llZ*1-&ffi.timtZl!k'felt:±>A *£[Sc i ence. Vo l. 239. P487-49 
mV^^^U^Vry^^-^^mzry^-tyx 1 (1988) ] lzft->X3 5Mcoimy4 ?)Wt>% 



-Elraer-Cetus)»TSaiki^* [^1] 
: 1 

£*10:£3 : 94 87 

K5»J^S^ : cDNA to genomic R N A (HC-G9 CDNA) 

GCCAGCCCCC TGA1GGGCGC GACACTCCGC CATGAATCAC TCCCCT6TGA GGAACTACTG 60 
TCTTCACGCA GAAAGCGTCT AGCCA7GGCG TTAGTATGAG TGTCGTGCAG CCTCCAGGAC 120 
CCCCCCTCCC GGGAGAGCCA TAGTGGTCTG CGGAACCGGT GAGTACACCG GAATTGCCAG 180 
GACGACCGGG TCCTTTCTTG GATTAACCCG CTCAATGCCT GGAGA1TTGG GCGTGCCCCC 240 
GCAAGACIGC TAGCCGAGTA GTGTTGGGTC GCGAAAGGCC TTGTGGTACT GCCTGATAGG 300 
GTGCTTGCGA GTGCCCCGGG AGGTCTCGTA GACCGTGCAC CATGAGCACG AATCCTAAAC 360 
CTCAAAGAAA AACCAAACGT AACACCAACC GCCGCCCACA GGACGTTAAG T7CCCGGGTG 420 
GCGGCCAGAT CGfTGGCGGA GTTTACTTGT TGCCGCGCAG GGGCCCCAGA GTGGGTGTGC 480 
GCGCGACGAG GAAGACTTCC GAGCGGTCGC AACCTCGCGG GAGGCGTCAG CCTATTCCCA 540 
AGGCCCGCCG ACCCGAGGGA AGGTCCTGGG CGCAGCCCGG GTACCCTrGG CCCCTCTATG €00 
GCAACGAGGG CTGTGGGTGG GCGGGATGGC TCCTGTCCCC CCGCGGCTCT CGGCCTAGTT 660 
GGGGCCCTTC TGACCCCCGG CGGAGGTCAC GCAATTfGGG TAAGGTCAFC GATACCCTCA 720 
CGTGTGGCTT CGCCGACCTC ATGGGGTACA TCCCGCTCGT CGGCGCTCCT CTAGGGGGCG 780 
CTGCCAGAGC TCTGGCACAT GGTGTTAGAG TCCTGGAAGA CGGCGTGAAT TACGCAACAG 840 
GGAACCTCCC CGGUGCTCT TTTTCTATCT TCITGCTCGC TCTTCTATCC TGCCTGACAG 900 
TCCCTGCTTC GGCC6TCGGA GTGCGCAACT CTTCGGGGGf GTACCATGTC ACCAATGATT 960 
GCCCCAATGC GTCCGTTGTG TACGAGACGG AGAACCTGAT CATGCATCTG CCCGGGTGTG 1020 
TGCCCTACGT ACGCGAGGGC AACGCCTCGA GGTGTTGGGT CTCCCFTAGT CCCACCGTAG 1080 
CCGCCAGGGA HCGCGCGTC CCCGTCAGTG AGGTTCGGCG TCGTGTCGAC TCGATTGTCG 1140 
GGGCCGCTGC GTTC7GTTCG GCTATGTATG TAGGGGACCT ATGCGGCTCC AICTTCCTTG 1200 
TTGGCCAGAT CTrCACttTC TCTCCCAGGC ACCATTGGAC GACGCAAGAC FGCAATTGCT 1260 
CCATCTACCC AGGCCATGTG ACAGGTCATC GAATGGCTTG GGACATGATC ATGAATTGGf 1320 

[0023] (4 ) cDNA^y^y-COfS^KJ:?. KVUVJilrYtiA (New England 

HC-G9, YS 1 1 7, fcJ;tfSR0 3 7<9iS£@2?l| Biolabs), T4 DNA;KLM?~ (Tak 
(O^S ara B i o c h em i c a 1 s ) tSfflfl M 1 3 

PCRlZX®mLfc&mtk&*<D&ffi®miirf-£T 4tfXW ?-KJ$AU ?n->itltz. ig^l^I 
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rmac i a) fcfflWCfr-jfc. fcSWfclCOkvC. %M 

U-ClOTSiftJgtfc. H C - G 9 >-Ctt^«« 
£. YS 1 1 7^(5y(CSR0 3 7tCO^T{i5' JgJ; 10 
0 6 3#BO*£*>6 1 8 6 7*@c7)Jg»2 TWiWIfc 
8259*B0»4>944O#Ii TOKWfcoV vt «Jg 
Ltl. BW*HtCHC-G9, SJW2JCYS1 17, 
tf>5' H. KW«3tYSl 1 7, <03' fflL Sftft4 
CSRO37055' III. BJ>J*5KSR0 3 7OffliBS 
WO 3' »3*fflSI»«lfcv"*. «2fcHC-G9.Y 

si 1 7, sR0 3 7^>iwiiHo«Rtt?&*r. 
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fE^3««:OHCVIi^-Ojag^St:^Sill.. 0 
2 CH C - G 9 fc £il*?fc£W«ffl«fiHjMiraj3*i 

r^i.i4c7)Hcv^i:otsaE5"jcofflisitt$-s-ro * 
disk, h c - g 9 fcmi^«»*sieF«««>sBi 

EH* *> 2 OKJ2U:0>#ffiHtt£ Ufr*\I fc ifim I 
fc. t*^HC-G9?r^tf3«i«c**c7)HCV 

i4a3iw*ensfctti«:*ifi,i«effifciMRSft 

<£ 0 . »rtr3*utttfis?ffltt** < 1 Mt 

TOU#8KJKU ^O^S^ti-f TC 1 a. 1 im 

iiifcftoTlcatWgWKiifJiitfcL 

[0024] 
[023 
[0025] 
[«2] 
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CACCCACTG6 CGCCTTAGTG GTCGCACAGC TACTCCGGAT CCCACAAGCT ATCGTG6ATA 1360 

rGATAGCTGG TGCCCACIGG GGTGTCCTAG CGGGCCTGGC ATACTACTCC ATGGTG6GGA 1440 
ACTGGGCTAA GGTTGTGGTC GTGCTGCTGC TCTTCGCTGG CGTT6ACGCA GAGACCCGGG 1500 
rCACAGGGGG GGCCGCTGGC CACACCGCGT TCGGGTTIGC TAGCTTCCTC GCCCCAGGCG 1560 
CTAAGCAAAA GATCCAGCTC ATAAATACCA ACGGCAGCTG GCACATCAAC AGAACTGCCT 1620 
TGAACTGTAA TGAAAGCFTG GATACTGGCT 6GCTAGCAGG GCT6CTCTAC IACCACAAGT 1680 
TCAACTCCTC AGGGTGTCCC GAGAGGATGG CTAGTTOCCA ACCTCTTACC GCCTTCGACC 1740 
AAGGGTGGGG ACCCATCACT CACGAGGGGA ATGCTAGTGA TGACCAGCGG CCATATTGTT 1500 
GGCACTATGC CCTACGCCCG TGTGGCATTG TGCCAGCGAA AAAGGTTTGC GGGCCTGTAT 1860 
ACrGTTTCAC ACCCAGCCCC GTGGTAGTGG GGACGACGGA CAGAGCCGGC GTTCCTACCT 1920 
ACAGATGGGG TGCCAATGAG ACGGATGTAC TGCTCCTCAA CAACTCTAGG CCGCCAATGG 1980 
GGAATTGGTT TGGGTGTACG TGGATGAATT CTAG1GGCT7 CACCAAGACG TGCGGGGCTC 2040 
CGGCCTGCAA CATCGGCGGG AGCGGGAACA ATACCCTGCT GTGCCCAACA GATTGCTTCC 2100 
GTAAACATCC GGATGCCACA TACAGCAGGT GCGGCTCTGG TCCCIGGCTT ACCCCTCGAT 2160 
GCTTGGTAGA CTACCCATAC AGGCTCTGGC ACTACCCCTG TACAGTCAAI TACACCATTT 2220 
TCAAGATCAG GATGT7TGTG GGCGGGGTTG AGCACAGGCT TGACGCCGCG TGCAACTGGA 2280 
CGCGGGGAGA GCGCTGCGAT TTGGACGACA GGGATCGGGC CGAGTTGAGC CCTCTGTTGC 2340 
TGTCCACTAC GCAATGGCAG GTCCTCCCCT GCTCATTCAC AACACTGCCC GCCCTGTCAA 2400 
CTGGCCTGAT ACATCTCCAC CAGAACATCG TGGACGTGCA GTACCTCTAT GGGTTGAGCT 2460 
CGGCAGTCAC ATCCTGGGTC ATAAAGTGGG AGTACGTTGT GCTCCTCTTC TTGCTGCTAG 2520 
CAGATGCTCG CATTTG7CCC TGCITGTGGA TGATGCTTCT CATATCTCAG GTAGAGACGG 2560 
CGCTGGAGAA CTTGATAGTT CTCAACGCTG CTTCCCTAGT CGGGACACAT GGCATCGTCC 2640 
CCTTCTTCAT CTTTTTUGT GCAGCTTGGT ACCTAAAAGG CAAGTGGGCC CCTGGACTCG 2700 
CCTATTCCGT CTATGGGATG TGGCCACTGC TCCFGCTTCT CCTGGCGTTG CCCCAACGG6 2760 
CATACGCCTT GGATCAGGAG HGGCCGCGT CGTGTGGGGC CACGGICTTC ATCTGCCTAG 2820 
CGGTGCTCAC TCTATCGCCA TATTACAAAC AGTACATGGC CCGCGGCATC TGGTGGCTGC 2880 
AGTACATGCT GACCAGAGCA GAGGCGCTCC TACAGGTTTG GGTCCCCCCG CTCAACGCCC 2940 
GACGAGGGCC CGACGGAGTC GTACTGCTCA CCIGTGTGCT CCACCCGCAC TTGCTCTTTG 3000 
AAATCACCAA GATCATGCTG GCCATTCTCG GGCCTTTGTG GATCTTGCAG GCCAGTCTGC 3060 
TCAAGGTACC GTACTTCGTG CGTGCCCACG GTCTCATTAG GCTCTGCATG CTGGTGCGCA 3120 
AGACAGCCGG CGGTCAGTAT GTGCAGATGG CTCTGTTAAA GCTGGGAGCA TTTGCCGGCA 3180 
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15 16 
CCTACAT7TA CAACCACCTT TCCCCGCrCC AAGAC7GGGC 7CACAGC6G7 CTCCGCGACC 3240 
7GGCGGTAGC CAC7GAACCC fiTCATCTTCT CCCGGA7GGA AATCAAGACT ATCACCTGGG 3300 
GQGCGGATAC TGCGGCTTGT GGAGACATCA TCAACGGGCT GCCTGTTTCC GCCCGGAGAG 3360 
GGAGAGA6GT G77GCTGGGA CCAGCCGATG CCCTGACTGA CAAAGGATGG AGGCTTTTAG 3420 
CCCCCATCAC GGCTTACGCC CAGCA6ACAC GGGGTCTCTT GGGCTGCATC ATCACCAGCC 3480 
TCACCGGTCG GGACAAAAAT CAAGTGGAGG GGGAAGTCCA GATTGTGTCT ACCGCAACCC 3540 
AGACGT7CTT GGCTACTTGT GTTAATGGAG TTTGCTGGAC TGTGTATCAT QGGGCCGGAT 3600 
CGAGGACCA7 CGCTTCGGCG TCGGOCCCTG TGATCCAGAT GTACACTAAT GTGGACCAGG 3660 
A777GG7GGG CTGGCCAGCG CCTCAGGGAG CGCGCTCCCT GACGCCG7GC ACATGCGGCG 3720 
CCTCGGATCT GTAC7TGG7C ACGAGGCACG CGGACGTCAF TCCAGTGCGG CGTCGGGGCG 3780 
ATAACAGGGG AAGTTTACTA TCTCCCCGGC CAATTTCATA TCTAAAGGGA fCCTCGGGAG 3840 
GCCCCCTGCT C7G7CCCA7G GGACA7GCCG TGGGCATTTT CAGGGCCGCG 67G7GCACCC 3900 
GTGGGGTCGC AAAGGCGGTC GACTTTGTGC CCGTTGAATC CCTAGAGACC ACCATGAGGT 3960 
CCCCAGTGTT TACCGACAAT TCCAGCCCTC CGACAGTGCC CCAGAGC7AC CAGGTGGCGC 4020 
ATCTGCACGC TCCCACTGCA AGTGGTAAGA GCACGAAGGT GCCGGCCGCC TATGCGGCTC 4080 
AAGGGTACAA GGTTCTTGTG CTGAACCCGT CTGTTGCTGC CACCCTAGGG TTCGGCGCTT 4740 
ATATGTCAAA GGCCCATGGG ATTGACCCAA ACGTCAGGAC TGGCGTAAGG ACCATTACCA 4200 
CAGCCTCCCC CATCACCCAC TCCACC7ACG GCAAATTCCT GGCTGACGGT GGGTGTTCAG 4260 
GAGGTGCGTA [GACATCATA ATATGTGACG AATGTCACTC AG7GGACGCC ACCTCGARC 4320 
TAGGCATAGG GACTGTCTTG GACCAAGCGG AGACAGCGGG GGTTAGGCTC ACIATCCTCG 4380 
CCACCGCTAC ACCACCTGGC TCCGTCACCG TGCCACArTC CAACATCGAG GAAGTTGCAT 4440 
TGTCCACTGA GGGGGAGATA CCATTCTATG GTAAGGCCAT CCCCCTAAAT TACATCAAGG 4500 
GGGGGAGGCA TCTCATfirC TCTCATTCCA AGAAGAAGTG CCACGAGCTC GCTGCAAAGC 4560 
TGGTTGGCCT GGGCGTCAAC GCAGTGGCCT 777ACCGCGG CCTC6ACGTG TCTOTCATCC 4620 
CAACCACAGG AGACG7CG77 GTTGTGGCGA CCGACGCCir AATGACTGGC TACACCGCCG 4680 
AT77CGAC7C CGTGATAGAC TGCAACACCT GTGTCGTCCA GACAGTCGAT TTCAGCCTAG 4740 
ACCCfACATT CTCTA7TGAG AC7TCCACCG TGCCCCAGGA CGCCG7G7CC CGC7CCCAAC 4800 
CGAGAGG7AG AACCGGTCGG GGGAAGCA7G GTA7C7ACAG A7A76TCTCA CCCGGGGAGC 4860 
GGCCGTC7GG CA7G7T7GAC TCCC7GGrCC 7CTG7GAG7G C7A7GACGCG GG77G7GCT7 4920 
GG7ATGAGCT 7ACACCCGCC GAGACCACGG 77AGG77ACG GGCA7ATC77 AACACCCCAG 4980 
GG77GCCCG7 G7GCCAGGAC CAC77GGAG7 777GGGAGAG CG7C77CACC GGCC7CACCC 5040 

[oo3i] 40 * mw&ztmx § * . *imz * o«fct»3es*ifc3r 

u&iwnski ft*4^<offitiMB3fiTvvtHc vco 'jd^^u^rry^^-imm-r^ztx-. mm 

h&^m$mmimzmm.tt^h*v cm.ncv <m\iFmm. 9M4$Mw&fiMg?>ji5g 

?<omiiikM&?u'<.i\scr>mmmzmmx'$&. ttM [003-21 
iktcmmzMLbzuz^y'u-vtLxm&iz. & immi 
^m^m^tm^hM^ztx-mm^uouco* 50 
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EW*^ : 1 

WM<T>g:% : 9 4 8 7 

wm*)M : tm 

m<r>& : -*« 

U¥\<r>m& : c D N A to genomic R N A (HC-G9 CDNA) 

GCCAGCCCCC TGATGGGGGC GACACTCCGC CATGAATCAC TCCCCTGTGA GGAACTACTG 60 
TCrTCACGCA GAAAGCGTCT AGCCATGGCG TTAGTATGAG TGTCGTGCAG CCTCCAGGAC 120 
CCCCCCTCCC GGGAGAGCCA TAGTGGTCTG CGGAACCGGT GAGTACACCG GAATTGCCAG 180 
GACGACCGGG TCCTTTCTTG GATTAACCCG CTCAATGCCT GGAGATTTGG GCGTGCCCCC 240 
GCAAGACTGC TAQCC6AGTA GTGTTGGGTC GCGAAAGGCC TTGTG6TACT GCCTGATAGG 300 
GTGCT7GCGA GTGCCCCGGG AGGTCTCGTA GACCGTGCAC CArGAGCACG AATCCTAAAC 360 
CTCAAAGAAA AACCAAACGT AACACCAACC GCCGCCCACA GGACG1TAAG T7CCCGGGTG 420 
GCCGCCAGAT CGTTGGCGGA GTTTACTTGT TGCCGCGCAG GGGCCCCAGA GTGGGTGTGC 430 
GCGCGACGAG GAAGACTTCC GAGCGGTCGC AACCTCGCGG GAGGCGTCAG CCTATTCCCA 540 
AGGCCCGCCG ACCCGAGGGA A6GTCCTGGG CGCAGCCCGG GTACCCTFGG CCCCTCTATG 600 
GCAACGAGGG CTGTGGGTCG GCGGGATGGC TCCTGTCCCC CCGCGGCfCT CGGCCTAGTT 660 
GGGGCCCTTC TGACCCCCGG CGGAGGTCAC GCAATUGGG TAAGGTCAFC GATACCCTCA 720 
CGTGTGGCIT CGCCGACCTC ATGGGGTACA FCCCGCTCGT CGGCGCTCCT CTAGGGGGCG 750 
CTGCCAGAGC TCTGGCACAT GGTGTTAGAG TCCTGGAAGA CGGCGTGAAT TACGCAACAG 840 
GGAACCTCCC CGGTTGCTCT TTTTCTATCT TCFTGCTCGC TCTTCTATCC T6CCTGACAG 900 
TCCCTGCTTC GGCCGfCGGA GTGCGCAACT CTfCGGGGGF GTACCATGTC ACCAATGATT 960 
GCCCCAArGC GTCCGTTGTG TACGAGACGG AGAACCTGAT CATGCATCTG CCCGGGTGTG 1020 
IGCCCTACGT ACGCGAGGGC AACGCCTCGA GGTGTTGGGr CTCCCrTAGT CCCACCGTAG 1080 
CCGCCAGGGA TTCGCGCGTC CCCGTCAGTG AGGTTCGGCG TCGTGfCGAC TCGATTGTCG 1140 
GGGCCGCTGC GTTCTGTTCG GCTATGTATG TAGGGGACCT ATGCGGCTCC ATCTTCCITG 1200 
TTGGCCA6AT CirCACCrTC TCTCCCAGGC ACCATTGGAC GACGCAAGAC TGCAATTGCT 1260 
CCATCTACCC AGGCCATGTG ACAGGTCATC GAATGGCTTG GGACATGATC ATGAATTGGr 1320 
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CACCCAC76G CGCCTTAGTG GT6GCACAGC TACTCCGCAT CCCACAAGCT ATCGTGGATA 1MO 
7GATAGC76G TGCCCACTGG GGTGTCCTAG CGGGCCTGGC ATACTACTCC ATGGTGGGGA 1440 
ACTQGGCTAA GGTTGTGGTC GTGC7GC7GC TCTTCGCTGfi CGTTGACGCA GAGACCCGGG 1500 
TCACAGGGCG GGCCGCTGGC CACACCGCGT TCGGGTTIGC TAGCTTCCTC GCCCCAGGCG 1560 
CTAAGCAAAA GATCCAGCTC ATAAATACCA ACGGCAGCTG GCACATCAAC AGAACTGCCT 1620 
TGAACTGTAA TGAAAGCFTG GATACTGGCT GGCTAGCAGG GCTGCTCTAC TACCACAAGT 1680 
TCAACTCCTC AGGGIGTCCC GAGAGGATGG CIAGTTGCCA ACCTCTTACC GCCTTC6ACC 1740 
AAGQGTGGGG ACCCATCACT CACGAGGGGA ATGCTAGTGA TGACCAGCGG CCATATTGTT 1800 
GGCACTATGC CCTACGCCCG 7GTGGCATTG TGCCAGCGAA AAAGGTT7GC GGGCCTGTAT 1*60 
ACrGTTTCAC ACCCA6CCCC GTGGTAGTGG GGACGACGGA CAGAGCCCGC GTTCCTACCT 1920 
ACAGAIGGGG TGCCAATGAG ACGGATGTAC TGCTCCTCAA CAACTCTAGG CCGCCAATGG 1380 
GGAATTGGTT TGGGTGTACG TGGATGAATT CTAGTGGCTT CACCAAGACG TGCGCGGCTC 2040 
CGGCCTGCAA CATCGGCGGG AGCGGGAACA ATACCCTGCT GTGCCCAACA GATTGCTTCC 2100 
GTAAACATCC GGATGCCACA TACAGCAGG7 GCGGCTCTGG TCCCTGGCTT ACCCCTCGAT 2160 
GCTTGGTAGA CTACCCATAC AGGCTCTGGC ACTACCCCTG TACAGTCAAT TACACCATTT 2220 
TCAAGATCAG GATGTTTGTG GGCGGGGT7G AGCACAGGC7 7GACGCCGCG 7GCAAC7GGA 2280 
CGCGGGGAGA GCGC7GCGAT T7GGACGACA GGGA7CGGGC CGAG77GAGC CC7C7G77GC 2340 
TGTCCAC7AC GCAA7GGCAG GTCCTCCCCT GC7CA77CAC AACACTGCCC GCCC7G7CAA 2400 
CT6GCCTGAT ACA7CTCCAC CAGAACATCG 7GGACG7GCA G7ACC7C7A7 GGG7TGAGC7 2460 
CGGCAG7CAC A7CC7GGGTC A7AAAG7GGG AG7ACG77G7 GC7CC7CT7C irGCTGC7AG 2520 
CAGAIGC7CG CA77TG7GCC 7GCITG7GGA TGATGCTTCT CA7A7C7CAG G7AGAG0CGG 2580 
CGC7GGAGAA C77GA7AG17 CTCAACGC7G C7TCCC7AG7 CGGGACACA7 GGCATCG7CC 2640 
CCTTCTTCAT C7T77T77G7 GCAGC77GG7 ACCTAAAAGG CAAGTGGGCC CCTGGAC7CG 2700 
CC7A77CCG7 C7A7GGGA7G 7GGCCAC7GC TCCFGCTTCT CC7GGCG77G CCCCAACGGG 2760 
CA7ACGCC77 GGA7CAGCAG 77GGCCGCG7 CG7G7GGGGC CACGG7C77C AICTGCC7AG 2820 
CGG7GC7CAC 7C7A7CGCCA 7A77ACAAAC AG7ACA7GGC CCGCGGCA7C TGG7GGC7GC 2880 
AG7ACA7GCT GACCAGAGCA GAGGCGCTCC 7ACAGG7TTG GG7CCCCCCG C7CAACGCCC 2940 
GACGAGGGCC CGACGGAG7C G7AC7GCTCA CGTG7G7GC7 CCACCCGCAC T7GCTC777G 3OO0 
AAA7CACCAA GA7CATGC7G GCCA77C7CG GGCC7T7G7G GA7C77GCAG GCCAGTCTGC 3060 
7CAAGG7ACC G7AC7TCGTG CG7GCCCACG G7C7CA77AG GCTCTGCA7G C7GGTGCGCA 3120 
AGACAGCGGG CGG7CAGTAT GTGCAGA7GG C7C7GTTAAA GCTGGGAGCA 777GCCGGCA 3180 
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CCTACAITTA CAACCACCTT TCCCCGCTCC AAGACTGG6C TCACAGC6GT CTGCGCGACC 3240 
TGGCGGTAGC CACTGAACCC GTCATCTTCT CCCGGATGGA AATCAAGACT ATCACCTGGG 3300 
GGGCGGATAC TGC6GCTTGT GGAGACATCA TCAACGGGCT GCCTGTTTCC GCCCGGAGAG 3360 
GGAGAGAGGT GTTGCTGGGA CCAGCCGATG CCCTGACTGA CAAAGGATGG AGGCTTTTAG 3420 
CCCCCATCAC GGCTTACGCC CAGCAGACAC GGGGTCTCTT GGGCTGCATC AICACCAGCC 3480 
TCACCGGTC6 GGACAAAAAT CAAGTGGACG GG6AAGTCCA GATrQTGTCT ACCGCAACCC 3540 
AGACGTTCTT GGCTACTTGT GTTAATGGAG TTTGCTGGAC TGTGTA7CAT GGGGCCGGAT 3600 
CGAGGACCAT CGCTTCG6CG TCG6GCCCTG TGATCCAGAT 6TACACTAAT GTGGACCAGG 3660 
ATTTGGTGGG CTGGCCAGCG CCTCAGGGAG CGCGCTCCCT GACGCCGTGC ACATGCG6CG 3720 
CCTCGGATCT GTACTTGGTC ACGAGGCACG CGGACGTCA7 TCCAGTGCGG CGTCGGGGCG 3780 
ATAACAGGGG AAGTTTACTA TCTCCCCGGC CAATTTCATA TCTAAAGGGA TCCTCGGGAG 3640 
GCCCCCTGCT CTGTCCCATG GGACATGCCG TGGGCATTTT CAGGGCCGCG GTGTGCACCC 3900 
GTG6GGTCGC AAAGGCGGTC GACTTTGTGC CCGTTGAATC CCTAGAGACC ACCATGAGGT 3960 
CCCCAGTGTT TACCGACAAT TCCAGCCCTC CGACAGTGCC CCAGAGCTAC CAGGTGGCGC 4020 
ATCTGCACGC TCCCACTGGA AGTGGTAAGA GCACGAACCT GCCGGCCGCC TATGCGGCTC 4080 
AAGGGTACAA GGTTCTT6TG CTGAACCCGT CTGTTGCTGC CACCCTAGGG TTCGGCGCTT 4140 
ATATGTCAAA GGCCCATGGG ATTGACCCAA ACGTCAGGAC TGGCGTAAGG ACCATTACCA 4200 
CAGGCTCCCC CATCACCCAC TCCACCTACG GCAAATTCCT GGCTGACGGT GGGTGT7CAC 4260 
GAGGTGCGTA TGACATCATA ATATGTGACG AATGTCACTC AGrGGACGCC ACCTCGARC 4320 
TAGGCATAGG GACTGTCTTG GACCAAGCGG AGACAGCGGG GGUAGGCTC ACIATCCTCG 4380 
CCACCGCTAC ACCACCTGGC TCCGTCACCG TGCCACA17C CAACATCGA6 GAAGTTGCAT 4440 
TGTCCACTGA GGGGGAGATA CCAFTCTATG GTAAGGCCAT CCCCCTAAAT TACATCAAGG 4500 
GGGGGAGGCA TCTCATTTTC TGTCA7TCCA AGAAGAAG7G CGACGAGCIC GCTGCAAAGC 4560 
IGGTTGGCCT GGGCGTCAAC GCAGTGGCCT TTTACCGCGG CCTCGACGTG TCTGTCATCC 4620 
CAACCACAGG AGACGTCG7T GTTGTGGCGA CCGACGCCTT AATGACTGGC TACACCGGCG 4680 
ATT7CGACTC CCTGATAGAC TGCAACACCT GTGTCGTCCA GACAGTCGAI TTCAGCCTAG 4740 
ACCCTACATT CTCTATTGAG ACTTCCACCG TGCCCCAGGA CGCCGrGTCC CGCTCCCAAC 4800 
CGAGAGGTAG AACCGCTCGG GGGAAGCATG GTATCTACAG ATATGTGTCA CCCGGGGAGC 4860 
GGCCGTCTGG CATGTT7GAC TCCGTGGICC TCTGrGAGTG CTATGACGCG GGTTGTGCT7 4920 
GGTATGAGCT TACACCCGCC GAGACCACGG 7TAGGTTACG GGCATATCTT AACACCCCAG 4980 
GGTTGCCCGT GTGCCAGGAC CACTTGGAGT TTTGGGAGAG CGTCITCACC GGCCTCACCC 5040 



(13) 8BB¥6-319563 

23 

ACATAGATGC CCACTTCCTG TCICAGACGA AACAGA6CGG GGAAAATTTC CCCTACCTAG 5100 
TCGCATACCA AGCCACCGTG TGCGCTAGAG CTAAAGCTCC TCCCCCGTCA TGGGACCAAA 5160 
TGT6GAAGTG CTTGATACGG CTCAAGCCCA CCCTCACTGG GGCTACCCCC CTACTATACA 5220 
GACTGGGTGG TGTGCAGAAT GAGATCACCC TAACACACCC AATCACCAAfl TACATCATGG 5280 
CTTGTATGTC GGCTGACCTG GAGGTCGTCA CTA6CACGTG GGTGCIGGTG GGCGGCGTCC 5340 
TGGCCQCT7T GGCCGCTTAC TGCCT6TCTA CAGGCAGCGT GGTCATAGTG GGCAGGATAA 5400 
TCCfAAGCGG GAAGCCGGCA GTCATTCCTG ACAGGGAGG7 TC7CTACCGA GAGTTTGATG 5460 
AGA7GGAAGA GTGCGCCGCC CACATCCCCT ACCTTGAGCA GGGGATGCAT TTGGCTGAAC 5520 
AGTTCAAGCA GAAAGCTCTC GGGTTGCTCC AGACAGCATC CAAGCAAGCA GAGACGATCA 5580 
CTCCCGCTGT CCATACCAAT TGflCAGAAAC TCGAQ7CCT7 CTGGGCTAAG CACATGTGGA 5640 
ACTTCGTCAG CGGGATACAA TACCTGGCGG GCC7GTCAAC GCTGCCC6G7 AATCCCGCTA 5700 
TAGCGTCGCT GATGTCGTTT ACAGCCGCGG TGACGAGTCC ACTAACCACC CAGCAAACCC 5760 
7CCTCTTTAA CATCC77GGG QGG7GGGTGG CCGCCCAGC7 7GCCGCCCCA GC7GCCGCCA 5320 
CTGCTT7CG7 CGGCGC7GGT AT7ACCGGCG CTG7CA7CGG CAG7G7GGGC CTAGGGAAGG 5680 
7CC7AG7GGA CATTC77GCT GGC7ACGGGG CTGG7G7GGC GGGGGCCCT7 GTGGC777CA 5940 
AGA7CA7GAG CGGGGAGGCC CCCACCGCCG AGGA7C7AGT CAACC77CTG CC7GCCA7CC 6000 
7C7CGCCAGG AGCfCTCGTT GTAGGCG7GG TG7GCGCAGC AA7ACTACGC CGGCACG7GG 6060 
GCCC7GGCGA GGGCGCGG7G CAGTGGA7GA ACCGAC7GAT AGCG7TTGCT TCICGGGC1A 6120 
ACCACGTC7C CCC7ACACAC 7A7GTGCCAG AGAGCGACGC G7CAGTCCG7 GTCACACATA 6180 
7CCTCACCAG CC7CAC7G7C AC7CAGC7CC TGAAAAGGC7 CCACG7G7GG ATAAGC7CAG 6240 
ATTGCACCGC CCCGTG7GC7 GG7TCT7GGC TCAAAGATGT CTGGGAC7GG A7ATGCGAGG 6300 
TGCTGAGCGA CUCAAGAGT TGGC7GAAGG CCAAACT7A7 GCCGCAACTG CCCGGGATCC 6360 
CA7TCGTA7C C7GTCAACGC GGG7ACCGTG GGG7CTGGCG GGGCGAAGGC ATCATGCACG 6420 
CCCGT7GCCC GTGTGGAGCC GA7A7AAC7G G7CA7G7CAA AAACGG77CG A7GAGAA7CG 6480 
TCGGCCCTAA GAC7TGCAGC AACACC7GGC G7GGG7CG7T CCCCATCAAC GCCCACAC7A 6540 
CGGGCCC77G CACACCC7CC CCAGCGCCGA ACTACACG7T CGCG7TA7GG AGGG7G7CGG 6600 
CAGAGGAG7A 7GTGGAGG7A AGGCGGCfGG GGGA777CCA 77ACA7CACG GGGG7GACCA 6660 
C7GATAAGA7 CAAGTG7CCA TGCCAGGTCC CCTCGCCCGA G77C77CACA GAGG7GGA7G 6720 
GGG7GCGCC7 ACA7AGG7AC GCCCCCCCC7 GCAAACCCCT GCTACGGGA7 GAGGTGACGT 6780 
77AGCA7CGG GCTCAA7GAA TACTTGGTGG GGTCCCAG77 GCCCTGCGAG CCCGAGCCAG 6840 
ACGTAGC7G7 ACTGACATCA AFGCTTACAG ACCCCTCCCA CA7CACTGCA GAGACGGCGG 6900 
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CGCGTAGACT GAATCGCGGG ICTCCCCCCT CCCFGGCTAG CTCnCTGCC AGCCAATTGT 6960 
CTGCGCCGTC CCTGAAAGCA ACATGTACCA CCCACCATGA CTCTCCAGAC GCTGACCTCA 7020 
TAACAGCCAA CCTCCTGTGG AGGCAGGAGA TGGGGGGGAA CATTACCAGA GTGGAGTCGG 7080 
AGAATAAGAT CGfCATCCTG GATTCTTTCG ACCCGCTCGT GGCGGAGGAG GATGATCGGG 7140 
AGATTTCTGT CCCAGCTGAG ATTCTGCTGA AGTCTAAGAA GTTTCCCCCC GCCATGCCTA 7200 
TATGGGCACG GCCAGATTAT AATCCTCCCC TTGTGGAACC ATGGAAGCGC CCGGACTACG 7260 
AACCACCCTT AGTTCACGGG TGCCCCCTAC CACCTCCCAA ACCAACTCCG GTGCCGCCAC 7320 
CCCGGAGAAA GAGGACAGTG GTGCTGGATG AGICTACAGT ATCATCTGCT CTGGCIGAGC 7360 
TTGCCACTAA GACCTTTGGC AGCTCTACAA CCTCAGCCGI GACAAGTGGT GAAGCGGCCG 7440 
AATCGTCCCC GGCGCCTTCC TGCGACGGTG AACTGGACTC CGAAGCTGAA TCTTACTCCT 7500 
CCATGCCCCC TCTCGAGGGG GAACCGGGGG ACCCCGATCT CAGCGACGGG TCTTGGTCTA 7560 
CCGTAAGCAG T6ATGGCGGT ACGGAGGACG TCGTGTGCTG CTGGATGTCC TACTCGTGGA 7620 
CGGGCGCCCT AATTACGCCC TGTGCCGCAG AGGAAACCAA ACTCCCCATC AACGCACTGA 7660 
GTAACTCGCT GCTGCGCCAC CACAATTTGG TGTATTCCAC CACCTCTCGC AGCGCTGGCC 7740 
AGAGGCAGAA AAAAGTCACA TTTGACAGGC TGCAGGTCCT GGACGATCAT TACCGGGACG 7600 
TGCTCAAGGA GGCTAAGGCC AAGGCATCCA CAGTGAAGGC TAAATTGCTA TCCGTAGAGG 7660 
AGGCATGTAG CCFGACGCCC CCGCACTCCG CCAGATCAAA ATTTGGCIAT GGGGCGAAGG 7920 
ATGTCCGAAG CCATTCCAGT AAGGCCATAC GCCACATTAA CTCCGTGTGG CAAGACCTTC 7960 
TGGAGGACAA TACAACACCT ATAGACACTA CCATCATGGC AAAGAATGAG GTCTTCTGCG 8040 
TGAAGCCCGA AAAGGGGGGC CGCAAGCCCG CTCGTCTTAT CGTGTACCCC GACCTGGGAG 8100 
TGCGCGrAIG CGAGAAGAGG GCTTTGTATG ACGFAGTCAA ACAGCTCCCC ATTGCCGTGA 8160 
TGGGAACCTC CTACGGGTTC CAGTACTCAC CAGCGCAGCG GG7CGACTTC CTGCTTAATG 8220 
CGTGGAAATC AAAGAAAAAT CCTATGGGGT TTTCCTATGA CACCCGUGC TTTGACTCGA 8280 
CAGTCACTGA CGCTGATATC CGTACGGAGG AAGACCTCTA CCAATCTTGT GACCTGGTCC 8340 
CTGAGGCCCG CGCGGCCATA AGGTCTCTCA CAGAGAGGCT TTACATCGGG GGCCCGCTTA 8400 
CCAACTCTAA GGGACAAAAC TGCGGCTATC GGCGATGCCG CGCGAGCGGC GTGCTGACCA 8460 
CTAGCTGCGG TAACACCATA ACCTGTTATC TCAAGGCCAG T6CAGCCFGT CGAGCTGCAA 8520 
AGCTCCGGGA CTGCACTATG CTCGTGTGCG GCGACGACCT CGTCGTTATC TGTGAGAGCG 8580 
CCGGTGTCCA GGAGGACGCT GCGAACCTGA GAGCCTTCAC GGAGGCTATG ACCAGGTACT 8640 
CCGCCCCCCC GGGAGACCCG CCTCAACCAG AATACGACTT GGAGCTTATA ACATCTTGTT 8700 
CCTCCAA[GT TTCAGTCGCG CACGACGGCG C7GGCAAAAG GGTCTACFAT CTGACCCGTG 8760 
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ATCCTGAGAC TCCCCTCGCG C6TGCCGCTT GGGAMCAGC AAGACACACT CCAGTGAACT 8820 
CCTGGCTAGG CAACATCATC ATGTTTGCCC CCACTCTGTG GGTACGGAfG GTCCTTATGA 8880 
CCCATTTTTI CTCCATACTC ATAGCCCAGG AACACCTTGA AAAGGCTCTA GATTGTGAAA 8940 
TCFATGGGGC CGTGCACTCC GTCCAACCGT TAGATCTACC TGAAAICATT CAAAGACTCC 9000 
ACGGCCTCAG CGCGTTCTCG CTCCATAGTT ACTCTCCAGG TGAAATCAAT AGGGTGGCTG 9060 
CATGCCTCAG GAAACTTGGG GTTCCGCCCT TGCGAGCTTG GAGACACCGG GCCCGGAGCG 9120 
TCCGCGCCAC ACTCCTATCC CAGGGGGGGA GAGCCGCTAT ATGCGGTAAG TATCTCTTCA 9180 
ACTGGGCGGT GAAAACCAAA CTCAAACTCA CTCCATTACC GTCCGCGTCT CAGTTGGACT 9240 
TGTCCAATTG GTTCAC6GGC GGTTACAGCG GGGGAGACAT TTATCACAGC GTGTCTCATG 9300 
TCCGGCCCCG CTGGTTCJTC TGGTGCCTAC TCCTACTTTC AGTGGGGGTA GGCATTTACC 9360 
TCCTTCCCAA CCGGTAGAGG GTTGGGCAAC CACTCCAGGC CTTTAGGCCC TGTTTAAACA 9420 
CTCCAGGCCT TTAGGCCCCG TTTTTTTTTT ITTTTTTTTT TTTTTTTTTT TTTTTTT7TT 9480 
TTTTTTr 9487 
[0033] 
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R*J« : 2 
WM<»&Z : 1 7 6 5 

mm : -** 

miomm-. cDN A to genomic RNA (VS117-5' CONA) 

CCATGGCGTT AGTA7GAGTG 7CGTGCAGCC TCCAGGACCC CCCCTCCCGG GAGAGCCATA 60 
GTGGTCTGCG GAACCGGTGA GTACACCGGA ATTGCCAGGA CGACCGGG7C CTTTCTTGGA 120 
TCAACCCGCT CAATGCCTGG AGATTTGGGC GTGCCCCCGC AAGACTGCTA GCCGAGTAGT 180 
GTTGGGTCGC GAAAGGCCTT GTGGTACTGC CTGATAGGGT GCTTGCGAG7 GCCCCGGGAG 240 
GTCTCGTAGA CCGTGCACCA TGAGCACAAA TCCrAAACCT CAAAGAAAAA CCAAACGTAA 300 
CACCAACCGC CGCCCACAGG ACGTTAAGTT CCCGGGTGGC GGCCAGATCG TTGGCGGAGT 360 
7TACTTGTTG CCGCGCAGGG GCCCCAGAG7 GGGTGTGCGC GCGACGAGGA AGACTTCCGA 420 
GCGGTCGCAA CCTCGCGGAA GGCGTCAGCC TATCCCCAAG GCCCGCCGAC CCGAGGGTAG 4*0 
GTCCTGGGCG CAGCCCGGG7 ACCCTTGGCC CCTCTATGGC AACGAGGGCT GCGGGTGGGC 540 
GflGATGGCTC CTGTCCCCCC GCGGCTCTCG GCCTAGTTGG GGCCCCAC7G ACCCCCGGCG 600 
GAGG7CACGC AA777GGG7A AGG7CA7CGA TACTC7CACG TGTGGC7TCG CCGACC7CA7 660 
GGGGTACA7C CCGC7CGrCG GTGC7CCTCT AGGGGGCGC7 GCCAGAGC7C FGGCACACGG 720 
7G7rAGAG77 C7GGAAGACG GCGTGAAC7A CGCAACAGGG AACC77CC7G G77GC7CCT7 780 
TTCTATCTTC 7TGC7CGC7C 77C7A7CC7G CC7GACAC7C CCfGC7rCGG CCG7CGAAG7 840 
GCGCAAC7CA 7CAGGGGFGT ACCA7GTCAC CAA7GA77GC CCCAA7GCG7 CCG77G7GTA 900 
CGAGACAGAG AGCC7GATCA 7GCA7C7GCC CGGG7GTGTG CCC7GCG7AC GCGAGGGCAA 960 
CGCC7CGAGG TGCTGGG!"C7 CCCT7AGCCC TACCA77GCC GC7AAGGA7C CGAGCG7CCC 1020 
CG7CAGTGAG AITCGGCG7C ACGTCGAC77 GA7CG7CGGG GCCGCCGCG7 7C7G77CGGC 1080 
7A7G7A7G7A GGGGACC7AT GCGGCTCCA7 CTTCCTCGFT GGCCAGA777 7CACC7TC7C 1140 
7CCCAGGCGC CA7TGGACGA CGCAGGAC7G rAA77GCTCC ATCTACCCGG GCCATGTGAC 1200 
AGGTCATCGA A7GGC77GGG ACA7GATGA7 GAA7TGG7CA CCCACTGGCG CCCTAGTGAF 1260 

GGCGCAGCTA C7CCGGA7CC CACAAGCTGT CGTGGATATG ATAGCCGGTG CCCACTGGCG 1320 
TGTCCTAGCG GGCCTGGCAT ACTACTCCA7 GGTGGGGAAC 7GGGCTAAGG TTG1GGTIGT 1380 
GCT6TTGCTC T7CGCTGGCG TCGACGCGGA CACCCAGGTC ACAGGAGGCA GCGCTGCCTA 1440 
TGATGCGCGC GGACTTGCTT CCCTTTTCAC CCCAGGCCC7 AAGCAAAACA 7CCAGCTCAT 1500 
AAATACCAAC CGCAGCTGGC ACATCAACAG GACCGCCTTG AACTGTAATG AAAGCCTGAA 1560 
CACCGGCTGG G7AGCAGGCC T6TTCTACTA 7CACAAA77C AACTCC7CGG GGTGTCC7GA 1620 
GAGGATGGCT AGTTGCCAGC CCCTCACCGC CTTTGACCAA GGGTGGGGAC CCATCACTTA 1680 
CGAGGGGAAT GCTAGCGGCG ACCAACGGCC ATATTGCTGG CACTATGCCC CACGCCCGTG 1740 
CGGTATTGTG CCGGCGAGAG AGGT7 1765 



(17) #1^6-3 195 63 

3 1 

[0034] 

kw«- : 3 

E*4<0££ .119 1 

fi?^SH: cDNA to genomic RNA (YS117-3' cONA) 

AACA6TCACT GAGGCTGATA TCCGTACGGA GGAAGACCTC TACCAATCTT GTGACCTGGT 60 

CCCCGAGGCC CGCACGGCCA TAAGGTCTCT CACA6AGAGG C7T7ACATCG GGGGCCCCCT 120 

TACCAA7TCC AAGGGACAAA ACTGCGGCTA fCGGCGATGC CGTGCAAGCG GCGTGCTGAC ISO 

CACTAGCTGC GCTAACACCA TAACCTGTTA TCTCAAGGCC AGCGCAGCCT GTCGAGCTGC 240 

AAAGCTCCAG GACTGCACCA TGCTCGTGTG CGGCGACGAC CTCGTCGTTA TCTGTGAGAG 300 

CGCCGGTGTC CAGGAGGACG CTGCGA6C0T GAGAGCCTTC ACGGAGGCTA TGACCAGGTA 360 

CfCCGCCCCC CCGGGAGACC CGCCTCAACC AGAATACGAC TTGGAGCTCA TAACATCCTG 420 

TFCCTCCAAC GTGTCAGTCG CGCACGACGG CTCTGGCAAA AGGG7C7ACT ATCTGACCCG 480 

TGATCCTGAG ACTCCCCTCG CGCGTGCCGC TTGGGAGACA GCAAGACACA CTCCAGTGAA 540 

CFCCTGGCTA GGCAACATCA rCATGTTTGC CCCCACTCTG TGGGTACGGA TGGTTCTTAT 600 

GACCCATTTT TTTTCCATAC FCATAGCCCA GGAGCACCTT GAAAAGGCTC TAGATTGTGA 660 

AATCTATGGA GCCGTACACT CCGTCCAACC GCTGGACCTA CCTGAAATCA TTCAAAGACT 720 

CCACGGCCTC AGCGCGTTTT CGCTCCACAG 77ACTCTCCA GGTGAAA7CA ATAGGGTGGC 760 

7GCATGCCTC AGAAAAC77G GGG7TCCGCC C7TGCGAGC7 TGGAGACACC GGGCCCGGAG 840 

COTCCGCGCC ACAC7CCTA7 CCCAGGGGGG GAAAGC7GCT ATATGCGG7A AG7ACC7C77 900 

CAAC7GGGCG GTGAAAACCA AACTCAAACT CAC7CCA7rA CCG7CCGCG7 CTCAG77GGA %0 

CT7GTCCAA7 7G6T7CACGG GCGGCTACAG CGGGGGAGAC ATT7ATCACA GCGTGTC7CA 1020 
TG7CCGGCCC CGTTGGTrCT TCTGGTGCC7 AC7CCTAC77 TCAG7GGGGG TAGGCA7CTA 1060 
707CC77CCC AACCGA7AGA CGG77GGGCA A7CACTCC7A GCC77TAGGC C7TA77TAAA 1140 
CACTCCAGGC C7TTAGGCCC 7GTTTT7T7T TTTTfTTTTT 7TTT7TTTT7 T 1191 



[003 5] 



(18) #1PF6-3 19563 

£*>J<7)£3 : 1 7 6 5 

m\<r>m : mat. 
mo® : ~*m 

ffiM^SS : cDNA to genomic RNA (SR037-5' cDHA) 

CCA7GGC6T7 AG7A7GAGTG TCGTGCAGCC TCCAGGACCC CCCC7CCCGG GAGAGCCATA 60 
GTGGTCT6CG GAACCGGTGA GTACACCGGA ATTGCCAG6A CGACCGGGTC CTTTCI7GGA 120 
TTAACCCGCT CAATGCCTGG AGA7TTGGGC GTGCCCCCGC AAGACTGCTA GCCGAGTAGT 180 
GTTGGGTCGG GAAAGGCCTT GTGGTACTGC CTGATAGGGT GCTTGCGAGT GCCCCGGGAG 240 
GTCTCGTAGA CCGTGCACCA TGAGCACGAA TCCTAAACCT CAAAGAAAAA CCAAACGTAA 300 
CACCAACCGC CGCCCACAGG ACGTCAAGTT CCCGGGTGGC GGCCAGATCG TTGGCGGAGT 360 
TTACTTGT7G CCGC6CAGGG GCCCCAGAAT GGGTGTGCGC GCGACGAGGA AGACTTCCGA 420 
GCGGTCGCAA CCTCGCGGAA GGCGTCAGCC TATTCCCAAG GCCCGCCGAC CCGAGGG7AG 480 
GTCCTGGGCG CAGCCCGGGT ACCCTTGGCC CCTCTATGGT AACGAGGGCT GTGGGTGGGC 540 
GGGATGGCTT CTGTCCCCCC GCGGT7CCCG GCCTAGTTGG GGCCCCTC7G ACCCCCGGCG 600 
GAGGTCACGC AACTTGGGTA AGGTCATCGA TACCCTCACG TGTGGCTTCG CCGACCTCAT 660 
GGGGTACATC CCGC7CGTCG G7GC7CC777 AGGGGGCGC7 GCCAGAGC7C 7GGCGCATGG 720 
7G7CAGAG77 C7GGAAGACG GCGTGAA77A 7GCAACAGGC AACC77CCCG G77GC7CT77 780 
77C7A7CT7C T7GC7TGCCC 77C7A7CC7G CC7GACAG7C CC7GC77CGG CCG7CGGAG7 840 
GCGCAAC7C7 7CGGGGG7G7 ACCA7G7CAC CAA7GA77GC CCCAATGCG7 C7G77G7G7A 900 
CGAGACAGAG AGCC7GA7CA 7ACA7C7GCC CGGG7G7GTG CCC7GCG7AC GCGAGGGCAA 960 
CGCC7CGAGG 7GC7GGG7C7 CCC77AG7CC 7AC7G77GCC GC7AAGGA7C CGAGCG7CCC 1020 
CG7CAG7GAG A7rCGACGCC A7G7CGACCT GATTG7CGGG GCCGCTGCG7 7C7G7TCGGC 1080 
TATG7ACGTA GGGGACC7A7 GCGGCFCCAT C77CC7CGT7 GGCCAGA777 TCACCCTCTC 1140 
7CCCAGGCG7 CAC7GGACGA CGCAGGAC7C 7AA77G77CC ATC7ACCCAG GCCA7G7GAC 1200 
AGGTCATCGA A7GGC7TGGG ACA7GATGA7 GAAT7GG7CA CC7AC7GGCG CCC7AG7GG7 1260 
GGCGCAGC7A C7CCGGATCC CACAAGC7G7 CG7GGA7ATG ATAGCCGG7G CCCACJGGGG 1320 
7G7CCTAGCG GGCCTGGCA7 ACFA77CCA7 GG7GGGGAAC 7GGGC7AAGG T7G7GG77G7 1380 
GCTGCTACTT 77TGC7GGCG TCGA7GCAGA GACCCAGGTC 7CAGGAGGC7 CCGC7GCCCA 1440 
AACCACGTAC GG7CT7AC7G CCC7C77CAG GACAGGCCCT AA7CAAAAAA TCCAGCTCAT 1500 
AAATACCAAC GGCAGCTGGC ATA7CAACAG GACCGCC7TG AAC7GTAATG AGAGCT7GCA 1560 
CACCGGC7GG CTGGCAGCGC 7G77C7ACAC CCACAAGTTC AACTCTTCGG GG7G7T7GGA 1620 
GAGGATGGCC AGTTGCCAGC C7C777CCGC C77CGACCAA GGG7GGGGAC CCA7CAC77A 1680 
CGGGGGGAA7 GCTAGCGACG GCCAACGGCC A7A7TGC7GG CAC7ATGCCC CACGCCCG7G 1740 
CGGrA77G7G CCGGCGAGAG AGG77 1765 



(19) #^¥6-3 1956 3 

35 36 

[0036] 

E*'J#^ : 5 

: 1 1 7 9 

mmm : m* 

MO® : 

KM^OSH : c D N A to genomic RN A (SR037-3* cONA) 

AACAG7CAC7 GAGGCTGATA TCCGCACGGA GGAAGACCTC TACCAATCTT GTGACCTGGT 60 
CCC7GACGCC CGCACGGCCA TAAGGTCCCT CACAGAGAGG CTTTACATCG GGGGCCCGCT 120 
TACCAATTCr AAGGGACAAA ACTGCGGC7A TCGGCGATGC CGCGCAAGCG GCGTGGTGAC 180 
CACFAGCTGC GGTAACACCA TAACCTGTTA TCTCAAGGCC AGTGCAGCCT GTCGAGCTGC 240 
AAAGCTCCGG GACTGCACTA TGCTCGTGTG CGGCGATGAC CTTGTCGTTA TCTGTGAGAG 300 
CGCCGGTGTC CAGGAGGACG CTGCGAGCCT GAGAGCC7TC ACGGAGGCTA FGACCAGG7A 3$0 
C7CTGCCCCC CCGGGAGACC CGCCTCAACC AGAATACGAC TTGGAGCTTA TAACATCCFG 420 
TTCCTCCAAT GTGTCAGTCG CGCACGACGG CGC7GGCAAA AGGGTCTACT ATCTGACCCG 480 
FGATCCfGAG ACCCCCCTCG CGCGTGCCGC TTGGGAGACA GCAAGACACA CTCCAGTGAA 540 
CTCCTGGCTA GGCAACATCA TTATGT7TGC CCCCAC7TTG 7GGG7ACGGA 7GGTCC7CA7 600 
GACCCAT77T TTC7CCA7AC TCAFAGCCCA GGAGCACC7T GAAAAGGCTC 7AGA77GTGA 660 
AATC7A7GGA GCCGTACAC7 CCA7CCAACC GC7GGACC7A CCTGAAA7CA 77CAAAGAC7 720 
CCACGGCC7C AGCGCG7T77 CGCTCCACAG 77AC7CTCCA GG7GAAA7CA A7AGGG7GGC 780 
7GCA7GCCTC AGAAAACT7G GGGTTCCGCC CTTGCGAGCT 7GGAGACACC GCGCCCGGAG 840 
CG7CCGCGCC ACAC7CC7A7 CCCAGGGGGG GAAAGCCGC7 A 7A7GCGG7 A. AG7ACCTC77 900 
CAAC7GGGCG GTGAAAACCA AAC7CAAACT CACrCCAFfA CCG7CCGCGT C7CAGTTGGA 960 
CT7GTCCAA7 7GGT7CACGG GCGGC7ACAA CGGGGGAGAC A77TATCACA GCGTGTCTCG 1020 
7GTCCGGCCC CG7TGGTTC7 7C7GGTGCC7 AC7CC7AC7C 7CAG7GGGGG TAGGCA7C7A 1080 
7C7CC77CCC AACCGA7AGA CGG7TGGGTA ATCACTCCAA GCC777AGGC CC77TT7AAA 1140 
CAC7CCAGGC C77T7GGCCC 7G77T77777 T777TTT7T 1179 

[00 3 7] 6 *ffl<0|ft:-*ffl 

ie?'JcO^§ : 2 0 40 h--Ko^'- : Wm 

W>\<?)$L:mk U¥\<r>mU: cDNA to genomic RNA 

mm ftmmi tttm ■. e 

h^ni/- : mm® ■ IB^J (#186) 

ffi^lWfiS : c DNA to genomic RNA AYGTACCCC A YGAGRTCGGC 20 
Wi&£t&£U:;frft : E (Yi±T;*fc{ic . RliGifcliA) 

KW<#321) [0039] Wm^: 8 

AACCTCCGCC GGGGATC AGA 20 W3\<T>&i$ : 2 0 

[0038] wmm^-. 7 wnem-im 

: 2 0 $C9§$ : —1fS& 

EHttShttR *50 K-Kny-ifiltR 



genomi c RN A 



20 



37 

E^'l^fUS :cDNA to genomic RNA 
Hffi^LtJS : E 
mt ( # 2 5 6 ) 

CGCGCGMCNA GG AARRCTTC 20 
(M<iA£fcliC . NteA. T. CifcJiG . RJi 
AttdtG) 

[0040] m$m^ : 9 
mw-gz ■■ 2 o 

E?i|?>S : SB 

mm ■. -*m 

E^JWSBS: cDNA to 
E0f(#lO4) 

AGRAARRCTT CSGAGCGRTC 
(RttGifcliA . StiCifcttG . ) 
[004 1] EW#*I : 1 0 

SMOfiS : 2 0 

E?iJc7)S : gift 

««« : -*« 

EM^SSB: cDNA to genomic RNA 

*fffl£fcseLfcari£ : E 

E?'l < # 1 3 2 ) 

YRCCTTGGGC ATAGGCTGAC 2 0 
(YJiTifcliC . RI4GifcttA . ) 
[0042] SE?lJ#^ : 1 1 

SJlJWfiS : 2 0 

mmm ■. mm 
mm ■■ -*« 

Effl0)ffl%: cDNA to genomic RNA 
*ffi*fc£Lfc#i£ : E 
EM < # 1 3 3 ) 



to genomic RNA 



20 



(20) ftHPP6-3 19 563 

38 

GARCCAWCCT GCCCAYCCYA 20 
(RUGttziiA . WUT£fciiA . YI*C&fcf* 
T . ) 

[0043] £H*9: 12 
EM** 3 : 2 0 
EM*>£ : 

mm ■. -*m 

Hffltofflfr : cDNA 
10 «8*ftSgLfc*S£: E 
EM (#1 34) 

CCAARAGGGA CGGGARCCTC 
(RliGtttiA . ) 
[ 0 0 4 4 ] E?IJ#^ : 1 3 
E*J*>£5 : 2 0 

mm -*« 

h-ifny- : TttSS^ 
EM<7)8S : c DNA t o 
20 «S*ifeSeLfc2F8;: E 
EW (#1 35) 

RCC YTCGTTT CCRTAC AGRG 
(RttGifcJiA . YJiCifcliT . ) 
[0045] SW»f - : 1 4 

E?lJ<7>;K£ : 2 0 

wm^m ■ wm . 

EM?)?!® : cDNA to genomic 
30 #®£ifc£L*:2f&: E 
EM (#296) 

GGATAGGCTG ACGTCTACCT 
[004 6] 



genomi c RNA 



20 



RNA 



20 



(21) ^^6-319563 

39 

Kfl*^ : 1 5 
Ei«*>£$ : 3 0 1 1 

ie^J<?3«« : Sfift (HC-G9 aiino acid) 

Het Sep Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr 
5 10 15 

Asn Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He 
20 25 30 

Val Gly Sly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Val Gly 
35 40 45 

Val Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly 
50 55 60 

Arg Arg Gin Pro Ho Pro Lys Ala Arg Arg Pro Glu Gly Arg ser 
65 70 75 

Trp Ala Gin Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly 
80 35 90 

Cys Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg Gly Ser Arg Pro 
95 100 105 

Ser Trp Gly Pro ser Asp Pro Arg Arg Arg Ser Arg Asn Leu Gly 
110 115 120 

Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Het Gly 
125 130 135 

Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg Ala 
140 145 150 

Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
155 160 165 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala 
170 175 180 

Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Val Gly Val Arg 



